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Abstract — In  this  work,  the  accuracy  of  soil  moisture  retrieved 
from  ASAR  data  over  bare  or  sparsely  vegetated  surfaces  is 
investigated  by  means  of  a  simulation  study.  The  soil  moisture 
retrieval  method  is  based  on  an  optimization  algorithm  that 
appropriately  inverts  theoretical  direct  models  by  assimilating  a 
priori  information  on  surface  parameters.  In  order  to  account  for 
a  large  variability  of  roughness  conditions,  two  complementary 
models  have  been  used,  namely  the  Integral  Equation  Method 
model  and  the  Geometrical  Optics  model.  The  performance  of 
the  inversion  method  has  been  assessed  on  simulated  noisy  ASAR 
data,  as  a  function  of  different  a  priori  information  quality  level. 

Soil  moisture;  SAR;  IEM  model;  GO  model;  Inversion 
method; 

I.  Introduction 

The  dependence  of  SAR  backscattering  on  soil  moisture 
and  surface  roughness,  over  bare  fields,  has  been  widely 
studied  and  modeled  [1].  Many  experiments  confirmed  a  good 
sensitivity  of  radar  measurements  to  soil  moisture  conditions. 
These  observations  make  sensible  to  address  the  inverse 
problem,  i.e.  retrieving  soil  moisture  from  backscattering  data. 
However,  the  experience  gained  with  single-parameter  SAR 
systems,  such  as  ESR-1/2,  has  shown  that  soil  moisture 
estimates  are  affected  by  large  errors  [2],  This  is  mainly 
because  many  combinations  of  soil  moisture  and  soil  roughness 
parameters  correspond  to  the  same  measured  backscattering 
value.  In  this  respect,  improvements  of  soil  moisture  estimation 
accuracy  can  be  obtained  principally  by  using  multi-parameter 
radar  measurements  and  a  priori  information  on  surface 
parameters.  The  accuracy  of  retrieved  soil  moisture  also 
depends  on  the  inversion  algorithm  adopted.  Algorithms  based 
on  the  inversion  of  theoretical  models  are  preferred  to 
empirical  relationships  because  the  latter  are  valid  for  limited 
and,  often,  site-dependent  data  sets.  Unlike,  model  based 
algorithm  can  be  adapted  to  a  wider  range  of  surface 
conditions.  In  this  study,  the  Integral  Equation  Method  (IEM) 
and  the  Geometrical  Optics  (GO)  models  are  employed, 
because  they  are  applicable  to  roughness  conditions  going  from 
smooth  to  very  rough  ones  [3]. 

Another  factor  that  can  be  exploited  to  improve  the  soil 
moisture  accuracy  is  the  quality  level  of  a  priori  information. 


The  a  priori  information  consists  of  tentative  knowledge  about 
the  soil  roughness  and  soil  moisture  conditions.  Prior 
information  on  soil  moisture  content  may  come,  for  example, 
from  networks  of  ground  stations  or  form  land  process  models. 
Whereas,  initial  guess  values  on  surface  roughness  may  be 
gathered  from  site  agricultural  calendar  or  from  simple 
empirical  relationships  between  SAR  backscattering  and 
roughness  state.  The  a  priori  information  is  of  high  quality 
level  when  it  is  close  enough  to  the  true  solution.  When  this 
occurs,  more  accurate  estimates  of  soil  moisture  are  possible  by 
constraining  the  optimization  algorithm.  This  corresponds  to 
search  solutions  close  to  the  a  priori  information,  instead  of 
searching  within  the  whole  parameter  space. 

In  this  study,  following  a  methodology  as  in  [4],  a  Bayesian 
framework  is  adopted  to  derive  an  optimization  algorithm, 
which  estimates  soil  moisture  content  from  HH&VV  SAR 
backscattering  values  at  C-band.  The  algorithm  inverts  IEM 
and  GO  models  appropriately  assimilating  a  priori  information 
on  surface  parameters. 

In  section  II,  the  soil  moisture  retrieval  problem  and  the 
optimization  algorithm  are  shortly  introduced.  Then,  the 
simulated  data  set  is  presented.  Subsequently,  the  dependence 
of  the  retrieved  soil  moisture  accuracy  on  the  simulated  SAR 
data  configurations,  measurement  errors  and  different  quality 
level  of  a  priori  information  is  investigated.  Finally, 
conclusions  are  given. 

II.  THE  SOIL  MOISTURE  RETRIEVAL  BY  MODEL  INVERSION 

The  relationship  between  multi-dimensional  (i.e.  multi¬ 
polarization,  multi-angle,  etc.)  radar  backscattering  coefficient, 
a o,  and  surface  geophysical  parameters ,p,  can  be  modeled  as: 

oJ  =  ftp),  (1) 

where /(  )  is  a  direct  theoretical  model.  In  this  paper,  bare  or 
sparsely  vegetated  surfaces  only  will  be  considered.  Then, 
direct  theoretical  models  are  surface  scattering  models,  namely 
IEM  and  GO. 

The  geophysical  parameters  which  characterize  the  soil 
surface  are:  the  complex  soil  dielectric  constant  A  the 
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roughness  parameters  expressed  in  terms  of  the  root  mean 
square  (rms)  height  of  soil  surface  s,  and  the  correlation  length 
/,  associated  to  the  soil  surface  autocorrelation  function  (ACF). 
The  ACF  is  assumed  exponential  when  1EM  surface  scattering 
model  is  used.  Whereas  it  is  Gaussian  when  the  GO  model  is 
employed.  In  this  case,  the  roughness  parameter,  which 
characterizes  the  very  rough  surfaces,  is  the  surface  rms  slope 
m  (i.e.  m  =  V2  s  / 1)  instead  of  independently  s  and  1. 

For  agricultural  soils,  the  imaginary  part  of  the  soil 
dielectric  constant  I(fO,  is  usually  much  smaller  than  the  real 
part  R(A).  Therefore,  to  reduce  the  number  of  independent  soil 
parameters,  we  have  approximated  the  imaginary  part  of  the 
soil  dielectric  constant  as  I(£,)=R(£f)/10.  In  the  inverse 
problem,  the  volumetric  soil  moisture  content  mv, 
corresponding  to  a  soil  layer  about  5  cm  thick,  is  obtained  first 
by  estimating  the  soil  dielectric  constant  and  then  by  inverting 
the  empirical  expression  derived  by  Flallikainen  et  al.  [5]. 

In  this  context,  our  goal  is  to  find  the  “best”  estimate  of  soil 
moisture  and  soil  roughness  p  which  inverts  (1),  given 
FIFl&VV  SAR  measurements  ad"  and  prior  estimates  of  soil 
parameters  p".  The  SAR  observations  ad”  differ  from  the 
theoretical  backscatter  ad  due  to  measurement  errors.  In 
addition,  the  non-linear  relationship  (1)  between  predicted  SAR 
backscatter  a,,'  and  geophysical  parameters  p  may  also  be 
affected  by  model  errors.  Using  the  Bayes’s  theorem,  the 
probability  density  function  (pdf)  of  p  conditioned  by  the 
observed  ad"  is 

P(P|  <0  «  P(<70»  P(p\p\  (2) 


where  P(ad"\p)  is  the  pdf  of  ad"  ■  knowing  that  the  geophysical 
parameters  are  p,  and  P{p\p")  is  the  pdf  of  p,  given  the 
knowledge  of  its  prior  estimate  p".  To  obtain  the  “best” 
estimate  of  p  either  the  mean  or  the  mode  of  P(p\ad"),  which 
are  the  minimum  variance  and  the  maximum  likelihood 
estimates  of p,  can  be  used,  respectively.  It  is  worth  mentioning 
that  if  the  pdf  in  (2)  are  Gaussian,  the  minimum  variance  and 
the  maximum  likelihood  estimators  coincide. 

Under  the  hypothesis  that:  SAR  observations  a0m  and 
parameters  p  are  related  by 

<=m+  (3) 

where  d11'1’  is  a  zero  mean  Gaussian  uncorrelated  vector, 
representing  the  cumulative  effect  of  measurement  and  model 
errors;  the  geophysical  parameters  p  distribute  around  a  guess 
value p° ,  i.e. 

p=p°+Ep,  (4) 

where  ep  is  a  zero  mean  Gaussian  uncorrelated  error.  Then,  the 
Bayesian  maximum  likelihood  estimator  of  p  for  a  generic 
observed  ad"  and  subject  to  prior  estimate  p  is  obtained  by 
maximizing  P(p\ad")  or  equivalently  by  minimizing  the 
functional 


J(p)=VW-flp))T  S'1  (ad'-Jlp))  +  1/2  (p-p°f  G-1  (p-p° ),  (5) 


where  S  and  G  are  diagonal  covariance  matrices  of  ad"  and  p, 
respectively  [4].  An  analytical  solution  which  minimize  J (p) 
does  not  exist,  then  an  iterative  approach  has  to  be  sought.  An 
efficient  and  quite  accurate  algorithm  exploited  to  minimize 
J(/j)  is  the  Generalized  Reduced  Gradient  Method  [6].  When 
no  prior  information  on  the  geophysical  parameters  is 
available,  the  pdf  P(p\pn)  in  (2)  may  be  modeled  as  a  constant 
term,  which  can  be  dropped  (i.e.  it  is  incorporated  into  the 
normalization  constant).  Then  the  functional  J (p)  in  (5)  reduces 
to  its  first  term  only,  which  represents  the  traditional 
minimization  of  the  mean  square  errors  between  measured  and 
predicted  backscattering  values,  i.e. 

J(p)=l/2(a0m-f[p))T  S1  (< a0m-flp )).  (6) 

III.  DEPENDENCE  OF  THE  RETRIEVED  SOIL  MOISTURE 
ACCURACY  ON  MEASUREMENTS  AND  A  PRIORI  INFORMATION 
ERRORS 

In  this  section,  a  simulation  study  on  the  impact  of 
measurement,  model  and  a  priori  information  errors  on  the 
accuracy  of  soil  moisture  and  soil  roughness  parameters 
retrieved  using  C-band  SAR  data  is  carried  out.  The  attention 
has  been  focused  on  the  ENVISAT  ASAR  configuration  in 
which  both  VV  &  HH  polarizations  are  acquired  at  23° 
incidence  angle.  For  the  sake  of  comparison,  some  plots 
include  both  the  traditional  ERS  SAR  configuration  (i.e.  VV  at 
23°  incidence  angle)  and  a  virtual  configuration  corresponding 
to  HH  &  VV  polarizations  acquired  contemporary  at  23°  and 
45°  incidence  angles.  SAR  measurements  have  been  simulated 
by  the  IEM  and  GO  models.  Since,  GO  predictions  at  HH  and 
VV  polarizations  are  equal,  VV  polarization  is  considered  only. 
The  selected  model  input  parameters  describe  a  wide  range  of 
roughness  and  moisture  conditions,  i.e.  from  smooth  to  very 
rough  surfaces  and  from  extremely  dry  to  wet  soils. 

The  IEM  data  were  simulated  by  using  the  following 
intervals  of  values:  relative  dielectric  constant  d  =  [3.0+j0.3, 
20.0+j2.0],  corresponding  approximately  to  a  volumetric 
moisture  content  interval  of  mv%  =  [3%  cm3  cm'3,  38%  cnf 
cm'3];  roughness  vertical  standard  deviation  s'  =  [0.6  cm,  2.1 
cm];  Exponential  Auto  Correlation  Function  (ACF)  with 
correlation  length  /  =  [6  cm,  24  cm].  The  GO  data  were 
simulated  by  using  the  same  range  for  d ,  but  higher  values  for 
the  roughness  s.  The  range  of  the  slope  m  used  is  the  interval 
[0.2, 0.3],  s>  3.0. 

To  quantitatively  study  how  measurement,  model  and  a 
priori  information  errors  propagate  on  the  retrieved  parameter 
accuracy,  noisy  data  have  been  simulated.  For  the  sake  of 
simplicity  measurement  and  model  errors  have  been 
represented  as  a  unique  total  error,  i.e.  d"°  in  (3).  Then,  the 
noisy  data  have  been  obtained  by  adding  on  the  theoretical 
IEM  and  GO  backscattering  values,  a  zero  mean  Gaussian 
random  noise  with  increasing  standard  deviation,  denoted  as 
(f,  which  ranges  between  0.5  dB  and  2.0  dB. 


The  error  on  a  priori  information,  i.e.  ep  in  (4),  has  been 
simulated  as  a  Gaussian  uncorrelated  noise,  and  it  has  been 
added  to  the  true  parameters,  to  obtain  the  input  guess  value, 
i.e.  p°.  The  quality  level  of  a  priori  information  depends  on  the 
standard  deviation  d  of  the  added  ep  error.  Four  different 
quality  levels  of  a  priori  information  have  been  considered  and 
reported  in  Tab.  I.  The  data  set  labeled  as  guess  mean  contains 
guesses  with  lowest  quality  and  has  been  simulated  simply  by 
taking  as  a  guess,  the  mean  value  of  the  range  of  each 
parameter.  Whereas,  the  data  sets  labeled  as  pb30,  pb20  and 
pblO  contain  guesses  of  increasing  quality.  More  precisely,  the 
d  values  have  been  set  equal  to  the  30%,  20%  and  10%  of  the 
whole  variability  range  of  each  parameter,  as  summarized  in 
Tab.  II.  It  follows  that  the  highest  quality  of  a  priori 
information  is  represented  by  the  data  set  pblO,  because  it 
contains  values  closer  to  the  expected  parameter. 

Given  a  (noisy)  measurement  and  using  a  guess,  it  is 
expected  that  the  optimization  algorithm,  after  some  iterations, 
can  fit  the  measurement  and  then  returns  an  estimated 
parameter  closer  to  the  true  one.  It  follows  that  the  error  in 
terms  of  discrepancy  between  the  estimated  and  expected 
parameters  reduces  from  an  initial  value  (computed  on  the 
guesses)  to  a  final  value  (computed  on  the  output  parameter 
returned  by  the  algorithm).  The  initial  rms  errors  associated  to 
pb30,  pb20  and  pblO  are  given  by  the  respective  d  values 
shown  in  Tab.  II.  Whereas,  those  associated  to  the  data  set 
guess  mean,  are:  s:  rms=0.45  cm;  /:  rms=5.48  cm;  nr. 
rms=0.03;  d  (real)  rms=5.05;  and,  mv%  cm3  cm3  rms=9.28. 

In  order  to  better  understand  the  need  of  introducing  a  priori 
information  into  soil  moisture  retrieval  algorithms,  we  firstly 
present  in  Fig.  1  the  behavior  of  soil  moisture  estimates 
obtained  by  minimizing  the  cost  function  (6).  The  diagonal 
terms  of  the  covariance  matrix  S  have  been  set  equal  to  the 
variance  of  the  noise  £"*.  As  guess  values,  the  data  set  guess 
mean,  i.e.  the  mean  values  of  the  parameter  range,  has  been 
used.  This  basically  corresponds  to  the  retrieval  of  soil 
moisture  from  backscattering  values  without  any  a  priori 
information. 

TABLE  I.  Guess  data  sets  simulated  with  different  quality 

LEVELS  ADOPTED  IN  THE  RETRIEVAL  ALGORITHM  AS  A  PRIORI  INFORMATION. 

£P=Expected  Parameter  value,  PR=Parameter  Range,  ^a=zero  mean 
Gaussian  random  variable,  (/^standard  deviation  of  rjK 


Data  set 

Guess 

Perturbation 

guess  mean 

Mean  value  of  range 

- 

Db30 

EP±rj3 

cG=30%  PR 

Db20 

EP±J]2 

cf ,=20%  PR 

o  bit) 

EP±t]i 

<fr =10%  PR 

TABLE  II.  Standard  deviation  cf  used  to  simulate  the  guess  for 

EACH  PARAMETER  OF  THE  IEM  AND  GO  MODELS.  THE  (f  VALUE  OF  THE  m,% 
PARAMETER  IS  ESTIMATED  FROM  £  BY  THE  EMPIRICAL  RELATIONSHIP. 


Data  set 

s  (cm) 

/  (cm) 

m 

f  (real) 

(mv%  cm3 cm'3) 

pb30 

cf=0.45 

cf =5. 40 

</=o.o3 

cf=  5.10 

(<f= 9.60) 

pb20 

cf  =0.30 

cf= 3.60 

cf =0.02 

cf= 3.40 

(cf=6.80) 

pblO 

</=o.i5 

cf  4.X0 

cf =0.01 

</=i.70 

(<7°=3.50) 

Fig.  1  shows  the  rms  error  of  mv%  cm3  cm'3  (i.e.  A mv%)  as 
a  function  of  the  standard  deviation  d  of  the  noise  for 
different  SAR  data  configurations  (VV  23°,  HH&VV  23°, 
HH&VV  23°&45°).  As  can  be  seen,  the  A mv%>  error  increases 
as  a  function  of  d  and  decreases  going  from  the  single¬ 
parameter  configuration  (i.e.  ERS,  VV  23°)  to  the  multi¬ 
parameter  configuration  (i.e.  FIH&VV,  23°&45°). 

When  d  is  equal  to  zero,  there  is  considerable  difference 
between  A mv%  errors  of  the  different  SAR  configurations. 
However,  as  soon  as  d  increases,  the  performances  of  the  three 
configurations  tend  to  get  closer  and  closer.  A  strong 
sensitivity  of  A mv%>  to  d  is  observed.  In  particular,  starting 
from  d  ~  1  dB  the  ERS  and  the  ENVISAT  configurations  are 
characterized  by  almost  the  same  performances  in  terms  of 
A mv%  error.  Since  it  is  unrealistic  to  have  a  total  budget  error 
significantly  smaller  than  1.0  dB,  a  priori  information  needs  to 
be  assimilated  into  the  retrieval  algorithm  to  benefit  from  the 
ASAR  double-polarized  configuration. 

The  attention  is  now  focused  on  the  performances  of  the 
retrieval  algorithm  using  a  priori  information  obtained  by 
minimizing  the  functional  in  (5).  The  diagonal  terms  of  the 
covariance  matrices  S  and  G  and  have  been  set  equal  to  the 
variance  of  the  noise  d0  and  ep,  respectively.  The  latter  are 
the  values  ( d )2  as  reported  in  Tab.  II. 

Fig.  2  shows  the  A mv%  error  of  as  a  function  of  the  guess 
quality  level  (related  to  the  standard  deviation  d)  for  different 
d  values  of  the  noise  dut>  of  a  typical  ENVISAT  configuration 
(i.e.  HH  &  VV  at  23°  incidence  angle).  Moreover,  the  initial 
guess  rms  errors  are  shown  by  the  curve  plotted  with  the 
continuous  line.  As  can  be  seen,  the  retrieval  algorithm 
reduces  significantly  the  initial  A mv%  error  from  about  9.6  to 
the  final  error  of  5.9  (at  d=  1.5  dB),  when  a  priori  information 
pb30  is  used.  This  rms  error  can  be  further  reduced  by  using 
backscattering  with  a  lower  noise  level  (for  example  d=0.5 
dB)  or  by  using  a  priori  information  with  a  higher  quality  level 
(for  example  pblO).  In  the  first  case,  A mv%  reduces  up  to  4.4. 
In  the  second  case,  it  decreases  to  2.8.  This  means  that  the 
quality  level  of  the  a  priori  information  is  the  main  factor  that 
can  significantly  reduce  the  A mv%  error.  These  final  A mv% 
errors  slightly  increase  with  the  noise  increasing  thus  indicating 
a  strong  robustness  of  the  algorithm  versus  d .  More  precisely, 
increasing  the  d  up  to  2.0  dB,  A mv%  increases  less  than  2% 
for  pb30  and  less  than  1%  for  pblO. 

In  order  to  investigate  the  presence  of  significant  bias  on 
the  retrieved  soil  moisture,  a  linear  fit  between  expected  and 
retrieved  soil  moisture  values  has  bee  carried  out.  Given  the 
linear  model  y=A+Bx,  (where  x  and  y  represent  the  expected 
and  retrieved  mv  values,  respectively)  the  parameter  A  and  B 
have  been  estimated,  and  are:  A=2.40%  cm3  cm"3,  B=0.92.  Fig. 
3  shows  the  scatter  plot  of  mv%  values  obtained  considering  the 
HH&VV  at  23°  incidence  angle  configuration,  a  noise  of  1.0 
dB  and  initial  guess  values  correspondent  to  a  priori 
information  correct  within  20%  of  the  each  parameter  range 
(i.e.  pb20).  The  linear  fit  is  shown  and  it  can  be  observed  how 
the  errors  distribute  about  the  fitted  line.  A  bias  is  present  but  it 
is  negligible. 
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Figure  1.  mv%  rms  error  versus  cf  noise  level  at  different  SAR  data 
configuration,  IEM  and  GO  simulated  data,  inversion  without  a  priori 
information. 
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Figure  2.  mv%  rms  error  versus  guess  quality  level  for  different  standard 
deviation  cf  of  the  noise  IEM  and  GO  simulated  data,  HH&VV  23°  SAR 
data  configuration. 
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Figure  3.  Scatter  plot  of  estimated  versus  expected  mv%. 


In  summary,  provided  that  the  total  error  budget  C"  is  less 
than  1.5  dB,  and  that  the  a  priori  information  for  soil  moisture 
is  correct  within  20%  of  their  whole  variability  range,  the  soil 
moisture  Amv%  can  be  estimated  with  a  rms  error  of  about  5% 
cm3  cm'3  by  using  HH&VV  C-band  SAR  data  at  23°  incidence. 

IV.  CONCLUSIONS 

In  this  paper,  the  soil  moisture  retrieval  accuracy  from 
ASAR  measurement  over  soils  with  a  large  roughness 
variability  have  been  investigated. 

The  study  considered  SAR  data  configurations 
corresponding  to  data  collected  by  sensors  on  board  of  the 
ENVISAT  satellite.  Data  were  simulated  by  IEM  and  GO 
models,  perturbed  by  Gaussian  noise.  Soil  moisture  content  has 
been  retrieved  by  using  an  optimization  method  assimilating  a 
priori  information  of  surface  parameters. 

Measurement  as  well  as  direct  model  errors  have  been 
considered  and  the  retrieval  performance  have  been  estimated 
as  a  function  of  the  a  priori  information  on  the  soil  surface 
state. 

The  developed  algorithm  demonstrates  that  by  using  a 
constrained  optimization  technique,  which  appropriately 
assimilates  a  priori  information  on  soil  parameters,  it  is  feasible 
to  retrieve  soil  moisture  with  an  rms  error  of  about  5%  cmJ 
cm"3,  from  HH  and  VV  SAR  backscatter  at  relatively  small 
incidence  angles  (i.e.  approximately  between  20°  and  35°) 
provided  sufficiently  accurate  (i.e.  within  20%  of  their  whole 
variability  range)  a  priori  information  on  surface  soil 
parameters  is  available. 
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